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The idea that a group of cooperating agents can solve problems more efficiently than when those 
agents work independently is hardly controversial, despite our obliviousness of the conditions that 
make cooperation a successful problem solving strategy. Here we investigate the performance of 
a group of agents in locating the global maxima of NK fitness landscapes with varying degrees of 
ruggedness. Cooperation is taken into account through imitative learning and the broadcasting 
of messages informing on the fitness of each agent. We find a trade-off between the group size 
and the frequency of imitation: for rugged landscapes, too much imitation or too large a group 
yield a performance poorer than that of independent agents. By decreasing the diversity of the 
group, imitative learning may lead to duplication of work and hence to a decrease of its effective 
size. However, when the parameters are set to optimal values the cooperative group substantially 
outperforms the independent agents. 


I. INTRODUCTION 

The benefits of cooperative work were explored by na¬ 
ture well before the advent of the human species as at¬ 
tested by the collective structures built by slime molds 
and social insects [T]. In the human context, the socio- 
cognitive niche hypothesis purports that hominin evo¬ 
lution relies so heavily on social elements to the point 
of viewing a band of hunter-gatherers as a ‘group-level 
predator’ [2]. Whether physical or social, those struc¬ 
tures and organizations may be thought of as the or¬ 
ganisms’ solutions to the problems that endanger their 
existence (see, e.g., [S]) and have motivated the intro¬ 
duction of the concept of social intelligence in the sci¬ 
entific milieu [3]. In the computer science circle, that 
concept prompted the proposal of optimization heuris¬ 
tics based on social interaction, such as the popular par¬ 
ticle swarm optimization algorithm mm and the perhaps 
lesser-known adaptive culture heuristic mm- 

Despite the prevalence of the notion that cooperation 
can aid a group of agents to solve problems more effi¬ 
ciently than if those agents worked in isolation, and of the 
success of the social interaction heuristics in producing 
near optimal solutions to a variety of combinatorial op¬ 
timization problems, we know little about the conditions 
that make cooperative work more efficient than indepen¬ 
dent work. In particular, we note that since cooperation 
(or communication, in general) reduces the diversity or 
heterogeneity of the group of agents, it may actually end 
up reducing the efficiency of group work [Hilo]. Effi¬ 
ciency here means that the time to solve a problem scales 
superlinearly with the number of individuals or resources 
employed in the task. 

In this contribution we study the performance of a 
group of cooperative agents following a first-principle re¬ 
search strategy to study cooperative problem solving set 
forth by Huberman in the 1990s that consists of tack¬ 
ling easy tasks, endowing the agents with simple random 
trial-and-test search tools, and using plain protocols of 
collaboration [nnn]. Here the task is to find the global 


maxima of three realizations of the NK-fitness landscape 
characterized by different degrees of ruggedness (i.e., val¬ 
ues of the parameter K). We use a group of M agents 
which, in addition to the ability to perform individual 
trial-and-test searches, can imitate a model agent - the 
best performing agent at the trial - with a probability 
p. Hence our model exhibits the two critical ingredients 
of a collective brain according to Bloom: imitative learn¬ 
ing and a dynamic hierarchy among the agents [3]. The 
model exhibits also the key feature of distributed coop¬ 
erative problem solving systems, namely, the exchange of 
messages between agents informing each other on their 
partial success (i.e., their htness at the current trial) to¬ 
wards the completion of the goal m- 

We find that the presence of local maxima in the fit¬ 
ness landscape introduces a complex trade-off between 
the computational cost to solve the task and the group 
size M. In particular, for a fixed frequency of imitation 
p, there is an optimal value of M at which the com¬ 
putational cost is minimized. This finding leads to the 
conjecture that the efficacy of imitative learning could be 
a factor determinant of the group size of social animals 

m- 

Our study departs from the vast literature on cooper¬ 
ation that followed Robert Axelrod’s 1984 seminal book 
The Evolution of Cooperation m since in that game 
theoretical approach it is usually assumed a priori that 
mutual cooperation is the most rewarding strategy to 
the individuals. On the other hand, here we consider 
a problem solving scenario and a specific cooperation 
mechanism (imitation) aiming at determining in which 
conditions cooperation is more efficient than independent 
work. 

The rest of the paper is organized as follows. In Section 
[TT| we offer a brief overview of the NK model of rugged 
fitness landscapes. In Section |Hl] we present a detailed 
account of the imitative learning search strategy and in 
Section [rV] we study its performance on the task of finding 
the global maxima of three realizations of the NK land¬ 
scape with TV = 12 and ruggedness parameters K = 0, 
K = 2 and K = 4. The rather small size of the solution 
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space (2^^ = 4096 binary strings of length 12) allows the 
full exploration of the space of parameters and, in partic¬ 
ular, the study of the regime where the group size is much 
greater than the solution space size. Finally, Section [V| 
is reserved to our concluding remarks. 

II. NK MODEL OF RUGGED FITNESS 
LANDSCAPES 

The NK model of rugged fitness landscapes introduced 
by Kauffman [151 IE] offers the ideal framework to test 
the potential of imitative learning in solving optimization 
problems of varied degrees of difficulty, since the rugged¬ 
ness of the landscape can be tuned by changing the two 
integer parameters - N and K - of the model. Roughly 
speaking, the parameter N determines the size of the 
solution space whereas the value of K = 0,..., — 1 in¬ 

fluences the number of local maxima and minima on the 
landscape. 

The solution space consists of the 2^ distinct binary 
strings of length iV, x = {xi,X 2 ,. ■. ,xn) with Xi = 0,1. 
To each string we associate a fitness value $ (x) which is 
an average of the contributions from each component i 
in the string, i.e., 

1 ^ 

® (x) = ^ (x), (1) 

' i=l 

where (jii is the contribution of component i to the fitness 
of string x. It is assumed that (j)i depends on the state 
Xi as well as on the states of the K right neighbors of 
j, i.e., (pi = (pi {xi, Xi+i ,..., Xi+x) with the arithmetic in 
the subscripts done modulo N. In addition, the functions 
(pi are N distinct real-valued functions on {0,1}^’’’^. As 
usual, we assign to each (pi a uniformly distributed ran¬ 
dom number in the unit interval [T5l[T^. Hence 4) € (0,1) 
has a unique global maximum. 

For K = 0 there are no local maxima and the sole 
maximum of is easily located by picking for each com¬ 
ponent i the state = 0 if pi (0) > pi (1) or the state 
Xi = 1, otherwise. For K > 0, finding the global max¬ 
imum of the NK model is a NP-complete problem 
which essentially means that the time required to solve 
the problem using any currently known deterministic al¬ 
gorithm increases exponentially fast as the size N of the 
problem grows m- The increase of the parameter K 
from 0 to A — 1 decreases the correlation between the 
fitness of neighboring configurations (i.e., configurations 
that differ at a single component) in the solution space 
and for K = N — 1 those fitness values are uncorrelated 
so the NK model reduces to the Random Energy model 
[B IB- To illustrate the effect of varying K on the 
ruggedness of the fitness landscape, in Fig. we show 
the relative fitness of a string as function of its 

Hamming distance d to the global maximum for N = 25 
and different values of K. Here <l>g stands for the fitness 
of the global maximum. For each K, the figure shows the 



FIG. 1. (Color online) Normalized fitness as fnnction of the 
Hamming distance to the global maximnm for N = 25 and 
K = 0 [circles), K = 5 [triangles) and K = 20 [inverted 
triangles). The lines are gnides to the eye. 


results for a single realization of the fitness landscape and 
for a single trajectory in the solution space that begins at 
the maximum [d = 0) and changes the state components 
sequentially until all N states are reversed [d = N). 

We note that the ruggedness of the landscapes (es¬ 
sentially, the number of local maxima) can vary wildly 
between landscapes characterized by the same values of 
A and a: > 0 [BUI] and so can the performance of any 
search heuristic based on the local correlations of the fit¬ 
ness landscape. Hence in order to appreciate the role of 
the parameters that are relevant to imitative learning, 
namely, the group size M and the imitation probability 
p, for fixed N and K we consider a single realization of 
the NK fitness landscape. 


III. IMITATIVE LEARNING SEARGH 

We assume a group or system composed of M agents. 
Each agent operates in an initial binary string drawn at 
random with equal probability for the digits 0 and 1. In 
the typical situation that the size of the solution space 
is much greater than the group size 2^ ^ M we can 
consider those initial strings as distinct binary strings, 
but here we will consider the case that M ^ 2^ as well, 
so that many copies of a same string are likely to appear 
in this initial stage of the simulation. In addition, we 
assume that the agents operate in parallel. 

At any trial t, each agent can choose with a certain 
probability between two distinct processes to operate 
on the strings. The first process, which happens with 
probability I — p, is the elementary or minimal move in 
the solution space, which consists of picking a string bit 
i = 1,..., N at random with equal probability and then 
flipping it. The repeated application of this operation is 
capable of generating all the 2^ binary strings starting 
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from any arbitrary string. The second process, which 
happens with probability p, is the imitation of a model 
string. We choose the model string as the highest fitness 
string in the group at trial t. The string to be updated 
(target string) is compared with the model string and 
the different digits are singled out. Then the agent se¬ 
lects at random one of the distinct bits and flips it so 
that the target string becomes now more similar to the 
model string. The parameter p € [0,1] is the imitation 
probability and the case p = 0 corresponds to the base¬ 
line situation in which the M agents explore the solution 
space independently. 

The specific imitation procedure proposed here was in¬ 
spired by the mechanism used to model the influence of 
an external media in the celebrated agent-based 

model proposed by Axelrod to study the process of cul¬ 
ture dissemination [25] . It is important to note that in 
the case the target string is identical to the model string, 
and this situation is not uncommon since the imitation 
process reduces the diversity of the group, the agent ex¬ 
ecutes the elementary move with probability one. This 
procedure is different from that used in m, in which 
strings identical to the model string are not updated 
within the imitation process. However, both procedures 
yield qualitatively similar results, except in the regime 
where imitation is extremely frequent, i.e., for p « 1. In 
particular, in the present implementation, a small group 
can find the global maximum in the case p = 1 since the 
model string can explore the solution space through the 
elementary move whereas the other strings are simply 
followers. 

The collective search ends when one of the agents finds 
the global maximum and we denote by t* the number of 
trials made by the agent that found the solution. Of 
course, t* stands also for the number of trials made by 
any one of the M agents, since they operate in paral¬ 
lel and the search halts simultaneously for all agents. 
In other words, the trial number t is updated, or more 
specifically, incremented by one unit, when the M agents 
have executed one of the two processes on its associated 
string. We note that except for the case p — 0, the up¬ 
date of the M agents is not strictly a parallel process 
since the model strings may change several times within 
a given trial. Nonetheless, since in a single trial all agents 
are updated, the total number of agent updates at trial 
t is given by the product Mt. 

The efficiency of the search strategy is measured by 
the total number of agent updates necessary to find the 
global maximum (i.e., Mt*) which can then be inter¬ 
preted as the computational cost of the search. In ad¬ 
dition, since we expect that the typical number of trials 
to success t* scales with the size of the solution space 
2^, we will present the results in terms of the rescaled 
computational cost, defined as C = Mt*/2^. 

An interesting variant of our imitation rule is obtained 
by relaxing the condition that only the string with the 
highest fitness value can be imitated and allowing any 
string to pose as a model according to an imitation prob- 
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FIG. 2. (Color online) Single-agent mean rescaled computa¬ 
tional cost (C) as function of the binary string length N that 
determines the size of the solution space 2^. These results, 
which were obtained by the numerical calculation of the eigen¬ 
values of the tridiagonal stochastic matrix T, do not depend 
on the ruggedness of the landscape. 
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ability function proportional to the string relative fitness 
(e.g., a Fermi function). Such fitness dependent imitation 
probability functions are frequently used in the study 
of cooperation dilemmas [Ml Ell and may be useful to 
prevent the string population being trapped in the local 
maxima. In this paper, however, we will consider only 
the noiseless or zero-temperature limit of those imitation 
functions, where only the string with the maximum rel¬ 
ative fitness can be imitated. 


IV. RESULTS 


Here we report the results of extensive Monte Carlo 
simulations of groups of agents that use imitative learn¬ 
ing to search for the global maxima of three representa¬ 
tive realizations of the NK fitness landscape. For fixed 
N and K we generate a single realization of the fitness 
landscape and carry out 10® searches in order to deter¬ 
mine the dependence of the mean rescaled computational 
cost (C) on the imitation probability p and on the group 
size M. In addition, for the baseline case in which the 
M agents explore the landscape independently {p = 0) 
we derive an analytical expression for the computational 
cost. 


A. Independent search 

In this case there is no imitation (p = 0) and the 
ruggedness of the landscape has no effect on the per¬ 
formance of the search, which depends only on the size 
of the solution space, 2^. Because of this independence 
on the landscape we can derive exact results for the time 
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needed for M independent agents to find the global max¬ 
imum. For this analysis we can assume that the global 
maximum is the string with the N digit values equal to 
1, i.e., (1,1,..., 1), without loss of generality. 

Let us consider first the case of a single agent that 
operates on a string with j digits 1. According to the 
elementary move, the probability that the resulting string 
has i digits 1 is 

T,, = (1 - j/N) + j W,,_i (2) 

for j = - 1, Tio = Si^i, and = Si^N, 

where Sij is the Kronecker delta. Since 0 < T^j < 1 
and 1 for j = 0,..., the matrix T is a 

tridiagonal stochastic matrix. The associated stochas¬ 
tic process has a single absorbing state i = N and 
TT^v (t) = {t — 1) yields the probability that 

the agent finds the solution before or at trial t. For large 
t this quantity is given ttjv (t) = 1 — (Aw)* where Aw is the 
second largest eigenvalue of T. (The largest eigenvalue 
is 1 because T is a stochastic matrix.) The probability 
that the agent finds the solution exactly at trial t = t* is 
Pn (t*) = TTw {t*) - TTw (t* - 1), i.e.. 



FIG. 3. (Color online) Mean rescaled computational cost (C) 
as function of the group size M for the imitation probability 
p — 0 [circles), p = 0.5 (triangles), p — 0.8 (inverted trian¬ 
gles), and p = 1 (sguares). The solid curve is eq. Q and 
the dashed line is the linear function (C) = M/2^. The pa¬ 
rameters of the NK landscape are A = 12 and K — 0. The 
landscape exhibits a single maximum. 


Pn (t*) = (XnY ^ (1 — Aw) , (3) 


which is a geometric distribution with success probabil¬ 
ity 1 — Aw- The value of Aw can be easily obtained 
numerically (see, e.g., [28]) provided N is not too large 
since 2-^^ (1 — Aw) is on the order of the unity and so it 
becomes practically impossible to distinguish Aw from 
1 for A > 30. The mean rescaled computational cost 
(C) = 1/ [2^^ (1 — Aw)] is shown in Fig. I^for different 
string lengths N. (Recall that M = 1 at this stage.) In 
particular for A = 12 we find A 12 « 0.99978 that implies 
(C) « 1.11612. For very large A, the probability of suc¬ 
cess becomes 1 — Aoo = 1/2^. We note that although eq. 
(§ is valid strictly for large t* only, the fact that t* is 
typically on the order of 2^ makes this geometric distri¬ 
bution an exceedingly good approximation to the correct 
distribution of absorbing times. 

Now let us consider the case of M = 2 agents search¬ 
ing independently for the global maximum. Since the 
process halts when one of the agents finds the global 
maximum, the halting time is ^2 = cciin{t*,t^} where 
f* and are independent random variables distributed 
by the geometric distribution (|^ . It is easy to show that 
the distribution of ^2 is also geometric with probability 
of success 1 — (Aw) [29]. In the case of M = 3 agents the 
halting time tg = min{t 2 ,ta} and since both ^2 and t* 
are geometrically distributed, though with distinct suc¬ 
cess probabilities, we find that tg is also geometrically 
distributed with probability of success 1 — (Aw) • The 
generalization of this reasoning to M agents yields that 
the mean scaled cost is 


(C) 


M 


2N 


1 — (Aw) 


M 


(4) 


Since Aw is close to 1 we can write (Xn)^ ~ e 
so that (C) « 1/ [ 2 ^ (1 — Aw)] for M ^ 1/ (1 — Aw) and 
(C) « M/2^ for M ^ 1/ (1 — Aw)- We recall that for 
A = 12 we have 1/ (1 — A 12 ) « 4545- As expected, eq- 
Q matches the simulation data very well (see Fig. |^. 


B. Search on a Smooth Landscape (K = 0) 

The NK fitness landscape with K = 0 exhibits a single 
global maximum and no local maxima. The results of 
the performance of the imitative search for a landscape 
with A = 12 and K = 0 are summarized in Fig. 

As shown in the previous subsection, the mean com¬ 
putational cost for non-interacting agents (p = 0) is a 
constant (C) « 1.12 provided the group size M is not 
too large compared to 1/ (1 — A 12 ) « 4545, which is close 
to the size of the solution space, 2^^ = 4096. In the case 
the group size M is very large, the agents begin to du¬ 
plicate each other’s work leading to a linear increase of 
(C) with increasing M. More pointedly, in this regime 
we find (C) = M/2^ (see the straight line in Fig. [^. We 
stress that a constant computational cost means that the 
time t* the group requires to find the global maximum 
decreases with the inverse of the group size (i.e., the time 
to solve the problem decreases linearly with the number 
of agents), whereas a computational cost that grows lin¬ 
early with the group size means that t* does not change 
as more agents are added to the group. 

Allowing the agents to imitate the best performer (mo¬ 
del) at each trial leads to a rapid reduction of the compu¬ 
tational cost provided the group size remains small. The 
best performance is achieved for M = 5 and p = 1 and 
corresponds to a fiftyfold decrease of the computational 
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TABLE I. Local maxima for the studied realization of the NK 
fitness landscape with = 12 and K = 2 


Maximum 

4>/4>, 

d 

1 

0.7681 

5 

2 

0.9114 

5 

3 

0.9204 

3 

4 

0.9801 

3 


cost with respect to the independent search (p = 0). The 
fact that the best performance is obtained when the im¬ 
itation probability is maximum is due to the absence of 
local maxima in the landscape for K = 0. We recall that 
for p = 1 only the model string, which is likely to be 
represented by several copies in the group, can perform 
the elementary move; all other strings must imitate the 
model. As a result, the effective size of the search space is 
greatly reduced, i.e., the strings are concentrated in the 
vicinity of the model string which cannot accommodate 
many more than M = 8 strings without duplication of 
work. This is the reason we observe the degradation of 
the performance when the group size increases beyond its 
optimal value. Note that for AT = 0 the imitative learn¬ 
ing search always performs better than the independent 
search. 


C. Search on a Slightly Rugged Landscape {K — 2) 

Now we consider a somewhat more complex NK fit¬ 
ness landscape by setting = 12 and K — 2. In the 
particular realization of the landscape studied here there 
are 5 maxima in total, among which 4 are local maxima. 
The relative fitness of those maxima (<I>/<i)g), as well as 
their Hamming distances to the global maximum (d), is 
presented in Table |T] 

The results of the imitative learning search are sum¬ 
marized in Fig.j^where the mean rescaled computational 
cost is plotted against the group size for different values of 
the imitation probability. The performance of the inde¬ 
pendent search {p = 0) is identical to that shown in Fig. 
1^ for AT = 0, since that search strategy is not affected 
by the complexity of the landscape. The results for the 
cooperative search {p > 0), however, reveal a trade-off 
between the group size M and the imitation probability 
p. In particular, for p > 0.8 we observe a steep increase 
of the computational cost for intermediate values of M. 
This happens because the group can be trapped near one 
of the local maxima. For large groups {M > 100 in this 
case), chances are that some of the initial strings are close 
to the global maximum and end up attracting the rest of 
the group to its neighborhood, thus attenuating the ef¬ 
fect of the local maxima. The robust finding is that for 
any p > 0 there is an optimal group size, which depends 
on the value of p, that minimizes the computational cost 
of the search. 



FIG. 4. (Color online) Mean rescaled computational cost (C) 
as function of the group size M for the imitation probability 
p = 0.5 [triangles), p = 0.8 [inverted triangles) , p = 0.95 
[squares) and p = 0.99 [circles). The solid line is the linear 
function (C) = M/2^. The parameters of the NK landscape 
are A = 12 and K = 2. The landscape exhibits 4 local 
maxima and a single global maximum. 



FIG. 5. (Color online) Time evolution of the relative fitness of 
the model string for M = 3 agents and probability of imitation 
p = 0.99. The thin horizontal lines indicate the relative fitness 
of the 4 local maxima given in Table |T] The parameters of the 
NK landscape are A = 12 and K = 2. 


A better understanding of the dynamics of the search 
is offered in Fig. which shows the relative fitness of 
the model string as function of the number of trials for 
M = 3 and p = 0.99. The role of the local maxima 
as transitory attractors of the search is evident in this 
figure. A typical run with M = 10 agents, which is ap¬ 
proximately the location of the maximum of (C) in Fig. 
1^ yields essentially a flat line at the highest fitness local 
maximum (maximum 4 in Table|T| and a sudden jump to 
the global maximum. Because there are many copies of 
the model string (the mean Hamming distance between 
the M = 10 strings is less than 1), the variants produced 
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FIG. 6. (Color online) Mean rescaled computational cost (C) 
as function of the group size M for the imitation probability 
p — 0.1 (circles), p — 0.3 (squares), p — 0.5 (triangles) and 
p = 0.8 (inverted triangles). For p = 0.8 we find (C) > 50 for 
M > 12. The solid line is the linear function (C) = M/2^. 
The parameters of the NK landscape are = 12 and K = A. 
The landscape exhibits 52 local maxima and a single global 
maximum. 

by the elementary move are attracted back to the local 
maximum. This is the reason why a small group can 
search the solution space much more efficiently than a 
large one in the case imitation is very frequent. 


D. Search on a Rugged Landscape (K = 4) 

The particular realization of the NK fitness landscape 
with = 12 and K = A that we consider now has 53 
maxima, including the global maximum, which poses a 
substantial challenge to any hill-climbing type of search 
strategy. As in the previous case, we observe in Fig. 
a trade-off between M and p, but now the results reveal 
how imitative learning may produce disastrous perfor¬ 
mances on rugged landscapes for certain ranges of those 
parameters. The strategy of following or imitating a 
model string can backfire if the fitness landscape exhibits 
high fitness local maxima that are distant from the global 
maximum. A large group may never escape from those 
maxima due to the attractive effect of the clones of the 
model string. This is what we observed in the case of 
p = 0.8 for which we were unable to find the global max¬ 
imum with groups of size M > 12. 

We note, however, that for a fixed group size M it 
is always possible to tune the imitation probability p so 
that the imitative learning strategy performs better than 
(or, in a worst-case scenario, equal to) the independent 
search. This point is illustrated in Fig. that shows the 
computational cost as function of p. For any fixed value 
of M, the computational cost exhibits a well-defined min¬ 
imum that determines the value of the optimal imitation 
frequency. As hinted in the previous figures, this optimal 


FIG. 7. (Golor online) Mean rescaled computational cost (C) 
as function of the probability of imitation p for the group 
size M = 5 (solid line), M = 10 (dashed line) and M = 100 
(dotted line). The parameters of the NK landscape are N = 
12 and K = A. The landscape exhibits 52 local maxima and 
a single global maximum. 

value decreases with increasing group size. 

In order to verify the robustness of our findings, which 
were obtained for specific realizations of the NK fitness 
landscape, we have considered four random realizations 
of the landscape with = 12 and AT = 4 in addition to 
the realization studied above. The comparison between 
the mean computational costs to find the global maxima 
of the five realizations of the landscape is shown in Fig. 
I^for the imitation probability p = 0.5. The results are 
qualitatively the same, despite the wild fluctuations of 
(C) in the regime where the search is trapped in the 
local maxima. It is reassuring to note that the initial 
decrease of the mean cost with increasing group size and 
the existence of an optimal group size that minimizes 
that cost, which are exhibited by all five realizations of 
the NK landscapes shown in Fig.|^ are robust properties 
of the imitative learning search. 

V. DISCUSSION 

In this paper we study quantitatively the problem solv¬ 
ing performance of a group of agents capable to learn by 
imitation. The performance measure we consider is the 
computational cost to find the global maximum of three 
specific realizations of the NK fitness landscape. The 
computational cost is defined as the product of the num¬ 
ber of agents in the group and the number of attempted 
trials till some agent finds the global maximum. Our 
main conclusion, namely, that for a fixed probability of 
imitation p there is a value of group size that minimizes 
the computational cost corroborates the findings of a sim¬ 
ilar study in which the task was to solve a particular 
cryptarithmetic problem m- Hence our conjecture that 
the efficacy of imitative learning could be a factor deter- 
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FIG. 8. (Color online) Mean rescaled computational cost {C) 
for the probability of imitation p = 0.5 as function of the 
group size M for five realizations (different symbols) of the 
NK fitness landscape with = 12 and K = A. The solid line 
is the linear function (C) = . 

minant of the group size of social animals (see [mSO] for 
a discussion on the standard selective pressures on group 
size in nature). We note that in the case the connectivity 
between agents is variable, i.e., each agent interacts with 
L = 1,..., M — 1 distinct randomly picked agent (here we 
have focused on the fully connected network L = M — 1 
only) then there is an optimal connectivity value that 
minimizes the computational cost |13] . It would be most 
interesting to understand how the network topology in¬ 
fluences the performance of the group of imitative agents 
and how the optimal network topology correlates with 
the known animal social networks m- 

The main aim of our contribution is to show that the 
existence of an optimal group size that maximizes per¬ 
formance for imitative learning m is insensitive to the 
choice of the fitness landscape, so it is likely a robust 
property of populations that use imitation as a coopera¬ 
tive strategy. Although we have focused on the effect of 
the parameter K, which roughly determines the rugged¬ 
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ness of the NK landscape, the parameter N (the length 
of the strings) also plays a relevant role on the search for 
the global maximum, besides the obvious role of fixing 
the size 2^ of the search space. (Of course, since the 
typical time to find the global maximum scales with 2^, 
even moderate values of N, say N = 16, would render 
the simulations unfeasible.) The nontrivial role is that 
the value of N seems to pose an upper bound to the op¬ 
timal size of the group M in the regime that imitation is 
very frequent (see Figs. |^|^and[^. This is so because 
in that regime the strings are concentrated in the close 
vicinity of the model string, which cannot accommodate 
more than M = N different strings. 

We do not purport to offer here any novel method to 
explore efhciently rugged landscapes, but the finding that 
for small group sizes imitative learning decreases consid¬ 
erably the computational cost of the search, even in a 
very rugged landscape (see data for p = 0.8 in Fig.|^ mo¬ 
tivates us to address the question whether in such land¬ 
scapes that strategy could achieve a better-than-random 
performance for all group sizes. This is achieved auto¬ 
matically for smooth landscapes (see Figs. and but 
not for rugged ones (see Fig. and m)- Clearly, the 
way to accomplish this aim is to decrease the probability 
of imitation p as the group size M increases, following 
the location of the minima shown in Fig. It is inter¬ 
esting to note that the finding that too frequent interac¬ 
tions between agents may harm the performance of the 
group (see Fig. may offer a theoretical justification for 
Henry Ford’s factory design in which the communication 
between workers was minimized in order to maintain the 
established efficiency and maximal productivity |32j . 
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